RAW SEQUENCE LISTING 
ERROR REPORT 



BIOTECHNOLOGY J 
SYSTEMS 
BRANCH 




The 



(STIC) detected errors when processing the following CRF diskette: 



Application Serial Number 
Art Unit / Team No. : 
Date Processed by STIC: ' 




THE 

PLEASE BE SURE TO FORWARD THIS INFORMATION TO THE APPLICANTS i 



EITHER: 



1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT 
COMMUNICATION TO THE APPLICANTS ALONG WITH A NOTICE TO 
COMPLY or, , 

2) CALLING APPLICANTS AND FAXING THEM A COPY OF THE PRINTOUT 
WITH A NOTICE TO COMPLY 

THIS WILL INSURE THAT THE NEXT SUBMISSION RECEIVED FROM THEM 
WILL BE ERROR FREE. 

IF YOU HAVE ANY FURTHER QUESTIONS, PLEASE CALL: 
MARK SPENCER 703-308-4212 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER 



: 09 //WW ? 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



Wrapped Nucleics 



Wrapped Aminos 



The number/text at the end of each line "wrapped" down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 

The amino acid number/text at the end of each line "wrapped " down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Incorrect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 
Misaligned Amino Acid The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 



Numbering 
5 Non-ASCII 

6 Variable Length 



Patentln ver. 2.0 "bug" 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 



10 ^ Use of n's or Xaa's 
(NEW RULES) 



between the numbering. It is recommended to delete any tabs and use spacing between the numbers. 

This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 

Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 
sequence(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 

to the subsequent amino acid sequence. 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(I) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPT10N:SEQ ID NO:X: 
This sequence is intentionally skipped 

Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



11 



Use of <213>Organism 
(NEW RULES) 

Us of <220>Featur 
(NEW RULES) 



Patentln ver. 2.0 "bug" 



Sequence(s) . 



_ are missing this mandatory field or its response. 



Sequence(s) are missing the <220>Featur and associated headings. 

Use of <220> to <223> is MANDATORY if <213>ORGANISM is "Artificial" or "Unknown- 
Please explain source of genetic material In <220> to <223> section. 

(See "Federal Register," 6/01/98, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 

Please do not use "C py to Disk" function of Patentln version 2.0. This causes a corrupted 
fil , r suiting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 
Instead, please use "File Manager" or any other means to copy file t floppy disk. 
AKS-Biotechnology Systems Branch- 5/15/99 



1652 

PAGE: 1 RAW SEQUENCE LISTING DATE: 11/15/1999 

PATENT APPLICATION US/09/147,947 TIME: 14:07:40 

Input Set: I 14 7 94 7. RAW 



This Raw Listing contains the General Information 
Section and up to first 5 pages. 



1 <110> APPLICANT : TSURUOKA, Nobuo 0e S Not Comply 

2 YAMASHIRO, Kyoko 

corrected Diskette Needed 

3 YAMAGUCHI, Nozomi 

4 <12 0> TITLE OF INVENTION: Novel Serine Protease 

5 <130> FILE REFERENCE: 001560-349 

6 <140> CURRENT APPLICATION NUMBER: US/09/147 , 947 

7 <141> CURRENT FILING DATE: 1999-03-24 

8 <150> EARLIER APPLICATION NUMBER: PCT/ JP98/03324 

9 <151> EARLIER FILING DATE: 1998-07-24 

10 <150> EARLIER APPLICATION NUMBER: JP 9/213969 

11 <151> EARLIER FILING DATE: 1997-07-24 

12 <160> NUMBER OF SEQ ID NOS : 6 

13 <170> SOFTWARE: Patentln Ver. 2.0 

14 <210> SEQ ID NO 1 

15 <211> LENGTH: 20 

16 <212> TYPE: DNA 

17 <213> ORGANISM: Artificial Sequence 

18 <220> FEATURE: - 

19 <223> OTHER INFORMATION: Synthetic DNA ^ / J / % IT 

20 <400> SEQUENCE^ 1 J)jLk )^ f 0 (P^ £/W\ >#</^rWU/| jfyLuJ\ 

W--> 21 gtgctcacM cimcbcaytg 20 

22 <210> SEQ ID NO 2 ^ 

23 <211> LENGTH: 20 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Artificial Sequence 

26 <220> FEATURE: 

27 <223> OTHER INFORMATION: Synthetic DNA 

28 <400> SEQUENCE: 2 ^ jp, 

W--> 2 9 agcggg^c^b cdgartcvcc AlA ^k^^ ' 20 

30 <210> SEQ ID NO 3 

31 <211> LENGTH: 2614 

32 <212> TYPE: DNA 

33 <213> ORGANISM: Mouse 

34 <220> FEATURE: 

35 <223> OTHER INFORMATION: 

36 <400> SEQUENCE: 3 

3 7 cgagggtggg gtggaggtcg gactccgggc tacagagctc ctggcgctca tcgcctctgg 60 

38 ctccagcctt tgcttcgcgg ggctgaccct ttgggtcccg gtgtgatcct ccagctgccc 120 

3 9 cgggggctgg gacagcaggg cggcggcgcg agcgtgggag ggggctctag gactctgccg 180 

40 gccccgcccc gccccctccg cggggacccg gagcccagca tggaccacac tcggcgccgc 240 

41 agcc atg gcg etc gcc cgc tgc gtg ctg get gtg att tta ggg gca ctg 289 

42 Met Ala Leu Ala Arg Cys Val Leu Ala Val lie Leu Gly Ala Leu 

43 1 5 10 15 

44 tct gta gtg gcc cgc get gat ccg gtc teg cgc tct ccc ctt cac cgc 337 



1652 



PAGE: 2 RAW SEQUENCE LISTING DATE: 11/15/1999 

PATENT APPLICATION US/09/147,947 TIME: 14:07:40 



Input Set: I 14 7 94 7. RAW 



45 


Ser 


Val 


Val 


Ala 


Ara 


Ala 


Asp 


Pro 


Val 


Ser 


Ara 


Ser 


Pro 


Leu 


His 


Ara 




46 










20 










25 










30 






47 


CCQ 


cat 


CCQ 


tec 


cca 


ccg 


cgt 


tec 


caa 


cac 


aca 

3 *-*3 


cac 


tac 


ctt 


ccc 


age 


385 


48 


Pro 


His 


Pro 


Ser 


Pro 


Pro 


Arq 


Ser 


Gin 


His 


Ala 


His 


Tvr 


Leu 


Pro 


Ser 




49 








35 










40 










45 








50 


tea 


ccrcr 


cqq 


cca 


ccc 


aqq 

u 3 3 


acc 


ccg 


cgc 


ttc 


ccg 


etc 


ccg 


ctg 


caa 

^33 


ate 


433 


51 


Ser 


Arq 


Arq 

3 


Pro 


Pro 


Arq 


Thr 


Pro 


Ara 


Phe 


Pro 


Leu 


Pro 


Leu 


Ara 

■"■4- ^ 


He 




52 






50 










55 










60 










53 


ccc 


get 


gee 


cag 


cgc 


ccg 


cag 


gtc 


etc 


age 


acc 


aaa 

333 


cac 


acg 


ccc 


ccg 


481 


54 


Pro 


Ala 


Ala 


Gin 


Ara 


Pro 


Gin 


Val 


Leu 


Ser 


Thr 


Glv 


His 


Thr 


Pro 


Pro 




55 




65 










70 










75 












56 


acg 


att 


cca 


cgc 


cgc 


tgc 


aaa 

3 33 


gca 


aaa 


aaa 


teg 


taa 

u 33 


aac 

33^ 


aat 


gee 


ace 


529 


57 


Thr 


He 


Pro 


Ara 


Ara 


Cvs 


Glv 


Ala 


Glv 


Glu 


Ser 


Trp 


Glv 


Asn 


Ala 


Thr 




58 


80 










85 










90 










95 




59 


aac 


etc 


acre 

33^ 


gtc 


ccg 


tgt 


eta 


cac 


taa 

^3 3 


gac 


aaa 


ata 

3 *-3 


ccg 


ccc 


ttc 


ctg 


577 


60 


Asn 


Leu 


Gly 


Val 


Pro 


Cvs 


Leu 


His 


Trp 


Asp 


Glu 


Val 


Pro 


Pro 


Phe 


Leu 




61 










100 










105 










110 






62 


aaa 


caa 


teg 


ccc 


ccg 


gee 


agt 


^33 


get 


aaa 


ctg 


caa 


aaa 

333 


cag 


ccg 


cac 


625 


63 


Glu 


Arq 


Ser 


Pro 


Pro 


Ala 


Ser 


Trp 


Ala 


Glu 


Leu 


Ara 


Glv 


Gin 


Pro 


His 




64 








115 










120 










125 








65 


aac 


ttc 


tqc 


caa 

*-*3 3 


age 


ccg 


gat 


aac 

33^ 


teg 


aac 

33^ 


aga 


cct 


taa 

u 33 


tgc 


ttc 


tat 


673 


66 


Asn 


Phe 


Cys 


Arq 


Ser 


Pro 


Asp 


Gly 


Ser 


Glv 


Ara 


Pro 


Trp 


Cys 


Phe 


Tvr 




67 






130 










135 










140 










68 


caa 


aat 


QCC 


caa 


ggc 


aaa 


gta 


gac 


taa 

^33 


aac 


tac 


tgc 


gat 


tgt 


aat 

33 


caa 


721 


69 


Arg 


Asn 


Ala 


Gin 


Gly 


Lys 


Val 


Asp 


Trp 

ir 


Gly 


Tvr 


Cvs 


Asp 


Cvs 


Glv 


Gin 




70 




145 










150 










155 












71 




ccg 


qca 

3^3 


ttg 


ccc 


gtc 


att 


cgc 


ctt 


gtt 


aat 

33 u 


aaa 

333 


aac 


agt 


aaa 

333 


cat 


769 


72 


Gly 


Pro 


Ala 


Leu 


Pro 


Val 


He 


Ara 


Leu 


Val 


Gly 


Glv 


Asn 


Ser 


Glv 


His 




73 


160 








165 










170 










175 




74 


gaa 


qqt 


cga 


ata 

3 ^*3 


gag 


ctg 


tac 


cac 


act 


aac 


cag 


taa 

u 33 


aaa 

333 


acc 


ate 


tgt 


817 


75 


Glu 


Gly 


Arg 


val 


Glu 


Leu 


Tyr 


His 


Ala 


Gly 


Gin 


Trp 


Gly 


Thr 


He 


Cvs 




76 










180 










185 










190 






77 


aac 


gac 


caa 


taa 

^3 3 


gac 


aat 


gca 


gac 


gca 


gac 


gtc 


ate 


tgt 


aaa 


cag 


ctg 


865 


78 


Asp 


Asp 


Gin 


Trp 


Asp 


Asn 


Ala 


Asp 


Ala 


Asp 

Ir 


Val 


He 


Cvs 


Ara 


Gin 


Leu 




79 








195 










200 










205 








80 


aaa 

33 3 


etc 


aat 


aac 

33 v 


att 


gee 


aaa 


aca 


taa 

^*33 


cat 


cag 


gca 


cat 


ttt 


aaa 

333 


gaa 


913 


81 


Gly 


Leu 


Ser 


Gly 


He 


Ala 


Lvs 


Ala 


Trp 

_r 


His 


Gin 


Ala 


His 


Phe 


Glv 


Glu 




82 






210 










215 










220 










83 










ata 


ttg 


1 1 CI 


era t 


y eld 




cgc 


tgc 


acc 


gga 


aac 


gag 




84 


Gly 


Ser 


Gly 


Pro 


He 


Leu 


Leu 


Asp 


Glu 


Val 


Arg 


Cys 


Thr 


Gly 


Asn 


Glu 




85 




225 










230 










235 












86 


ctg 


tea 


att 


gag 


caa 


tgt 


cca 


aag 


agt 


tec 


tgg 


ggc 


gaa 


cat 


aac 


tgt 


1009 


87 


Leu 


Ser 


He 


Glu 


Gin 


Cys 


Pro 


Lys 


Ser 


Ser 


Trp 


Gly 


Glu 


His 


Asn 


Cys 




88 


240 










245 










250 










255 




89 


ggc 


cat 


aaa 


gaa 


gat 


get 


gga 


gtg 


tct 


tgt 


gtt 


cct 


eta 


aca 


gat 


ggt 


1057 


90 


Gly 


His 


Lys 


Glu 


Asp 


Ala 


Gly 


Val 


Ser 


Cys 


Val 


Pro 


Leu 


Thr 


Asp 


Gly 




91 










260 










265 










270 






92 


gtc 


ate 


aga 


ctg 


gca 


gga 


gga 


aaa 


agt 


acc 


cat 


gaa 


ggt 


cgc 


ctg 


gag 


1105 


93 


Val 


He 


Arg 


Leu 


Ala 


Gly 


Gly 


Lys 


Ser 


Thr 


His 


Glu 


Gly 


Arg 


Leu 


Glu 




94 








275 










280 










285 









% 
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PAGE: 3 RAW SEQUENCE LISTING DATE: 11/15/1999 

PATENT APPLICATION US/09/147,947 TIME: 14:07:40 

Input Set: 1147947. RAW 



95 


cite 


tac 


tac 


aag 


yyy 


cag 


L yy 


rrrrrr 

yyy 


CLOCl 


rr t" c 


tat 

t-y t 


aat 

y CL L, 


aat 

gat 


rrrr c 

ggc 


t rrrr 

t-yy 


ar<t 


1 1 CT 
lljJ 


96 


Val 


Tvr 
a j j - 


Tvr 


Lvs 


Glv 


Gin 


Trn 


Glv 


Thr 


Val 


Cys 


Asp 


Asp 


Glv 


Trn 


Thr 




97 






290 










295 










300 










98 


rra rr 


atg 


aac 


aca 


tac 


crt*cr 

y t-y 


rr r 1 1~ 

y u l. 


tgt 


rrr a 

^*y ct 


c trr 
v> L-y 


*»* ^*y 


rrrra 

yyct 


ttt 

C U L. 


ctctct 


tar* 

L. Ct 


rrrr r 1 

yy c 




99 


Glu 


Met 


Asn 


Thr 


Tvr 
j 


Val 


Ala 




Arg 


Leu 


Leu 


Glv 


Phe 


Lys 


Tvr 


Gly 




100 




305 










3 10 










315 












101 


clclcl 


cag 


tec 


tct 


rr t rr 

y u y 


aac 


cat 


1 1 t 

ILL 


aat 
y ct i— 


rrrr r 1 

yyc 


arrp 
cty <w 


CLCLO 


agg 


CCC 


a t a 

Ct U Ct 


tgg 


1 OA Q 
X ^ *± -7 


102 


Lys 


Gin 


Ser 


Ser 


Val 


Asn 


His 


Phe 


Asp 


Glv 


Ser 


Asn 


Arg 


Pro 


He 


Trn 
j. i.p 




103 


320 










325 










33 0 










335 




104 


c tg 


y at 


rra r* 
y <x\* 


at c 

y 


cty \^ 


t rrr* 
L-y \~ 


t* r*a 

L. V_. CL 


rrrr 3 

yya 


aaa 


gaa 


y 


cty 


1 1 c 

L. L. O 


att 
clC L. 


cag 


t rrt 
i_y l. 


1 O Q *7 


105 


Leu 


Asp 


Asp 


Val 


Ser 


Cys 


Ser 


Gly 


Lys 


Glu 


Val 


Ser 


Phe 


Ile 


Gin 


Cys 




106 










340 










345 










3 50 






107 


tec 


o rrrr 


aga 


cag 


t CTCT 

c-yy 


rrrra 

yyo- 


a rrrr 

<*yy 


r»a t 


rr ^ r 1 
y ctw* 


4- rr r 1 
uy 


o rr c 
cty 


r«a t 

Oct L. 


cty d 


gaa 


rra t 
y d L. 


gtg 


1 7 A c; 


108 


Ser 


Ara 


Arg 


Gin 




Gly 




His 


Asp 


Cys 


Ser 


His 


Ai y 


Glu 




Val 




109 








355 










360 










365 








110 


yy 


etc 


acc 


tgc 


tat 


cct 


era o 
y cx 


a rr r 1 
cty 


aat 

y CL L> 


rrrr^i 

yy^ 


rat 


a rrrr 

a.yy 


rt t 


t rt 


CCS 


rr rr t" 

yy t 


i j j j 


111 


Glv 


Leu 


Thr 


Cys 


Tvr 


Pro 


Asp 


Ser 


Asp 


Glv 


His 


Ara 

«xy 


Leu 


Ser 


Pro 


Glv 




112 






370 










375 










3 80 










113 


ttt 


ccc 


ate 


aga 


eta 


rr t rr 

y L y 


aat 
y a l> 


aaa 


aaa 


aat 


ctcty 


-a rr 
ctcty 


rr 3 a 
y ctct 


rrrra 

yyc 1 


r» rr a 


rr t rr 

y^s 


1 44 1 

J-*± *± X 


114 


Phe 


Pro 


lie 


Arcr 


Leu 


Val 


Asp 


Glv 


Glu 


Asn 


Lys 


Lys 


Glu 


Glv 


Ara 


Val 




115 




385 










3 90 










9R 

J _7 -J 












116 


aaa 


att 


ttt 


gt c 


aat 


aac 

yy *- 


caa 


f- rrrr 

tyy 


rrrr ^ 

yy* 1 


a r*a 


at r 

Ct Ul* 


t ar« 
i~y V— 


aat 
y ct l> 


rra r* 
y ciw* 


rrrra 

yyci 


t rrrr 

t yy 


1/QQ 
XI O -7 


117 


Glu 


Val 


Phe 


Val 


Asn 


Glv 


Gin 




Glv 
oxy 


Thr 


He 


Cys 


Asp 


Asp 


Glv 


Tm 




118 


400 










405 










410 










415 




119 


clCC 


gat 


aag 


cat 


gca 


get 


at - a 

y u y 


ate 


tar 1 
t-y 


caa 

<-yy 


parr 


r*t t 


rrrr r' 

yyc 


tat 


a a rr 
ctcty 


rrrr t 

yy 


1 CTT 

X J J / 


120 


Thr 


Asp 


Lys 


His 


Ala 


Ala 


val 


He 


Cys 


«xy 


Gin 


Leu 


Glv 


Tvr 


Lys 


Glv 




121 










420 










425 










43 0 






122 


cct 


gec 


aga 


gca 


o-yy 


act 


atg 


get 


tat 


ttt 

u u u 


rrrrrr 

yyy 


y ctct 


rrrra 

gyci 


aaa 
ctctct 


erne 

yyc 


r* r» r 1 




123 


Pro 


Ala 


Arci 


Ala 


Arcr 


Thr 


Met 


Ala 


Tvr 


Phe 


Glv 


Glu 


Glv 


Lys 


Glv 


Pro 




124 








435 










440 










445 








125 


ate 


cac 


atg 


gat 


aat 


rr t a 

y u y 


aag 


tgc 


aca 


aaa 

yya 


aat 

del L- 


rr 3 rr 


aarr 
ctcty 


rjee 


c t rr 
uy 


rrr 1 1 
y ^ L- 


1 ^ 

X D J J 


126 


lie 


His 


Met 


Asp 


Asn 


Val 


Lys 


Cys 


Thr 


Glv 


Asn 


Glu 


Lys 


Ala 


Leu 


Ala 




127 






450 










455 










460 










128 


CfcLC 


tgt 


at c 
y l.u 


aaa 


caa 


rr ^ r* 

y en- 


att 


aaa 

yya. 


3 rrrr 

ciyy 


r 1 a r* 


a ;3 r* 
ctct I* 


tgc 


CCTC 

v^y ^ 


r*a r* 


aat 
cty l. 


rra rr 

yag 


1 R 1 

X O O X 


129 


Asp 


Cys 


Val 


Lys 


Gin 


Asp 


He 


Glv 


Arcr 


His 


Asn 


Cys 


Ara 


His 


Ser 


Glu 




130 




465 










470 










475 












131 


gat 


gca 


rrrra 

yyo. 


gt c 


ate 


fat 
uy l. 


gac 


tat 


t t 3 


rra rr 

y<*y 


cl cty 


ctcLct 


rrr'a 
y \~cl 


♦■pa 
U L^ct 


aat 
cty i— 


arrt 
cty i_ 


x / ^ y 


132 


Asp 


Ala 


Glv 


Val 


lie 


Cys 


Asp 


Tvr 

JT 


Leu 


Glu 


Lys 


Lys 


Ala 


Ser 


Ser 


Ser 




133 


480 










485 










490 










495 




134 


ggt 


aat 


aaa 


gag 


atg 


etc 


tea 


tct 


gga 


tgt 


gga 


ctg 


agg 


tta 


ctg 


cac 


1777 


135 


Gly 


Asn 


Lys 


Glu 


Met 


Leu 


Ser 


Ser 


Gly 


Cys 


Gly 


Leu 


Arg 


Leu 


Leu 


His 




136 










500 










505 










510 






137 


cgt 


egg 


cag 


aaa 


egg 


ate 


att 


ggt 


ggg 


aac 


aat 


tct 


tta 


agg 


ggt 


gec 


1825 


138 


Arg 


Arg 


Gin 


Lys 


Arg 


He 


He 


Gly 


Gly 


Asn 


Asn 


Ser 


Leu 


Arg 


Gly 


Ala 




139 








515 










520 










525 








140 


tgg 


cct 


tgg 


cag 


get 


tec 


etc 


agg 


ctg 


agg 


teg 


gee 


cat 


gga 


gac 


ggc 


1873 


141 


Trp 


Pro 


Trp 


Gin 


Ala 


Ser 


Leu 


Arg 


Leu 


Arg 


Ser 


Ala 


His 


Gly 


Asp 


Gly 




142 






530 










535 










540 










143 


agg 


ctg 


ctt 


tgt 


gga 


get 


acc 


ctt 


ctg 


agt 


age 


tgc 


tgg 


gte 


ctg 


aca 


1921 


144 


Arg 


Leu 


Leu 


Cys 


Gly 


Ala 


Thr 


Leu 


Leu 


Ser 


Ser 


Cys 


Trp 


Val 


Leu 


Thr 





% 
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PAGE: 4 RAW SEQUENCE LISTING DATE: 11/15/1999 

PATENT APPLICATION US/09/147,94.7 TIME: 14:07:40 



Input Set: 1147947. RAW 



145 






545 










550 










555 












146 




get 


gca 


cac 


tgc 


ttc 


aaa 


acres 
ctyy 


tac 


crcra 
yyo- 


aac 


aac 


L-*-*y 


a CTCT 


a fir 
cty \— 


U ex U 


y wCl 


1 QCQ 

J- J O .7 


147 




Ala 


Ala 


His 


Cvs 


Phe 


Lvs 


Arcr 


Tvr 


Glv 


Asn 


Asn 


Ser 


Arg 


Ser 


Tvr* 


Ala 




148 




560 










565 










570 










-j / 




149 




gtt 


cga 


gtt 


CTCTCT 

yyy 


gat 


tat 


cat 


act 


c tg 


gta 


cca 


cracr 

y o.y 


era cr 


ttt 


era a 
y ctct 


oa a 


2017 


150 




Val 


Arcr 


Val 


Glv 


Asp 


Tvr 
j 


His 


Thr 


Leu 


Val 


Pro 


Glu 


Glu 


Phe 


Glu 


Gin 




151 












580 










585 










U Zs \J 






152 




gaa 


ata 


yyy 


crtt 
y t- u 


caa 


cag 


att 




aM - 


zi r* 

<w Ct l_« 


«-yy 


a a f 


UCt^* 


3 CTCT 

ayy 








153 




Glu 


lie 


Glv 


Val 


Gin 


Gin 


He 


Val 


He 


His 


Arcr 


Asn 


Tvr 




Pro 


Asp 




154 










595 










600 










605 








155 




3-CJcL 


age 


gac 


tat 


gac 


att 


gee 


c tg 


gtt 


cty ct 


ttg 


caa 


rrrrPi 

yyct 


cca 


ctctct 

yyy 


gag 


2113 


156 




Arcr 


Ser 


ASD 


Tvr 


Asp 


He 


Ala 


Leu 


Val 


Arcr 


Leu 


Gin 


Glv 


Pro 


Glv 


Glu 




157 








610 










615 










620 










158 




CcLcl 


tgt 


gec 


aga 


eta 


ciy 


ace 


cac 


gtt 


ttg 


cca 


gec 


tgt 


tta 


cct 


eta 


2161 


159 




Gin 


Cys 


Ala 


Arcr 


Leu 


Ser 


Thr 


His 


Val 


Leu 


Pro 


Ala 


Cys 


Leu 


Pro 


Leu 




160 






625 










630 










635 












161 






aga 


cracr 


acrcr 
«*y y 


cca 


cag 


aaa 


aca 


gec 


tec 


aac 


tgt 


cac 


ata 


aca 


yyo. 


2209 


162 




Tro 


Arcr 


Glu 


Arcr 


Pro 


Gin 


Lys 


Thr 


Ala 


Ser 


Asn 


Cys 


His 


He 


Thr 


Glv 




163 




640 










645 










650 










655 




164 




tcrcr 


crcra 


gac 


aca 


crest' 

y y ° 


cgt 


gee 


tac 


tea 


aga 


act 


eta 


caa 


caa 


CSC t~ 


get 


2257 


165 




Tro 


Glv 


Asp 


Thr 


Glv 


Arcr 


Ala 


Tvr 


Ser 


Arcr 


Thr 


Leu 


Gin 


Gin 


Ala 


Ala 




166 












660 










665 










670 






167 




crtcr 


cct 


ctcr 
*-y 


tta 


ccc 


aag 


acrcr 

a y y 


ttt 


tgt 


aaa 


cracr 

y a y 


acrcr 

a yy 


tac 


aag 


crcra 
yyct 


eta 


2305 


168 




Val 


Pro 


Leu 


Leu 


Pro 


Lys 


Arcr 


Phe 


Cys 


Lys 


Glu 


Arcr 


Tvr 


Lys 


Glv 


Leu 




169 










675 










680 










685 








170 




ttt 


act 


crcrcr 
yyy 


aga 


atg 


etc 


tat 

uy l. 


get 


cscses 

yyy 


aac 


etc 


caa 


gaa 


gac 


aac 


cgt 


2353 


171 




Phe 


Thr 


Glv 


Arcr 


Met 


Leu 


Cys 


Ala 


Glv 


Asn 


Leu 


Gin 


Glu 


Asp 


Asn 


Arg 




172 








690 










695 










700 










173 




oho 
y u y 


gac 


acre 

W.y \~ 


tgc 


cag 


y y a 


gac 


agt 


ci era 
yya. 


crcra 
yya. 


cca 


etc 


atg 


tgt 


gaa 


aag 


2401 


174 




Val 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


Leu 


Met 


Cys 


Glu 


Lys 




175 






705 










710 










715 












176 




cct 


gat 


gag 


tec 


tgg 


gtt 


gtg 


tat 


ggg 


gtg 


act 


tec 


tgg 


ggg 


tat 


gga 


2449 


177 




Pro 


Asp 


Glu 


Ser 


Trp 


Val 


Val 


Tyr 


Gly 


Val 


Thr 


Ser 


Trp 


Gly 


Tyr 


Gly 




178 




720 










725 










730 










735 




179 




tgt 


gga 


gtc 


aaa 


gac 


act 


cct 


gga 


gtt 


tat 


acc 


aga 


gtc 


ccc 


gee 


ttt 


^ *± j / 


180 




Cys 


Gly 


Val 


Lys 


Asp 


Thr 


Pro 


Gly 


Val 


Tyr 


Thr 


Arg 


Val 


Pro 


Ala 


Phe 




181 












740 










745 










750 






182 




gta 


cct 


tgg 


ata 


aaa 


agt 


gtc 


acc 


agt 


ctg 


taacttatgg aaagctcaag 


2547 


183 




Val 


Pro 


Trp 


He 


Lys 


Ser 


Val 


Thr 


Ser 


Leu 
















184 










755 










760 


















185 




aaaatagtaa aacagtaacc attcagtctt catacttggc 


accatgccag aaaaaaaaaa 


2607 


186 




aaaaaaa 






























2614 


187 


<210> 


SEQ 


ID NO 4 






























188 


<211> 


LENGTH : 


761 






























189 


<212> 


TYPE: PRT 






























190 


<213> 


ORGANISM: Mouse 




























191 


<220> 


FEATURE : 
































192 


<223> 


OTHER INFORMATION: 


























193 


<400> 


SEQUENCE: 4 






























194 




Met 


Ala 


Leu 


Ala 


Arg 


Cys 


Val 


Leu 


Ala 


Val 


He 


Leu Gly Ala 


Leu 


Ser 





• 



1652 
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195 


1 








5 










10 










15 




196 


Val 


Val 


Ala 


Arcr 


Ala 


Asp 


Pro 


Val 


Ser 


Aver 


Ser 


Pro 


Leu 


His 


Aver 

Ax.y 


Pro 


197 








20 










25 










30 






198 


His 


Pro 


Ser 


Pro 


Pro 


Arcr 


Ser 


Gin 


His 


Ala 


His 


Tvv 
xyx 


Leu 


Pro 


Ser 


Ser 


199 






35 










40 










45 








200 


Aver 


Aver 


Pro 


Pro 


Aver 


Thr 


Pro 


Arg 


Phe 


Pro 


Leu 


Pro 


Leu 


Aver 


He 


Pro 


201 




50 










55 










60 










202 


Ala 


Ala 


Gin 


Arg 


Pro 


Gin 


Val 


Leu 


Ser 


Thr 


Gly 


His 


Thr 


Pro 


Pro 


Thr 


203 


65 










70 










75 










80 


204 


lie 


Pro 


Aver 


Aver 


Cys 


Gly 


Ala 


Gly 


Glu 


Ser 




Gly 


Asn 


Ala 


Thr 


Asn 


205 










85 










90 










95 




206 


Leu 


Glv 


Val 


Pro 


Cys 


Leu 


His 


X J-P 


Asp 


Glu 


Val 


Pro 


Pro 


Phe 


Leu 


Glu 


207 








100 










105 










110 






208 


Arg 


Ser 


Pro 


Pro 


Ala 


Ser 




Ala 


Glu 


Leu 


Arg 


Gly 


Gin 


Pro 


His 


Asn 


209 






115 










120 










125 








210 


Phe 


Cys 


Arcr 


Ser 


Pro 


Asp 


Glv 


Ser 


Glv 


Aver 


Pro 


Tvn 


Cys 


Phe 


Tvv 
xyx. 


Aver 


211 




130 










135 










140 










212 


Asn 


Ala 


Gin 


Glv 


Lys 


Val 


Asp 


Trn 

XX, LJ 


Gly 


Tvv 
xyx 


Cys 


Asp 


Cys 


Gly 


Gin 


Gly 


213 


145 










150 










155 










160 


214 


Pro 


Ala 


Leu 


Pro 


Val 


lie 


Aver 


Leu 


Val 


Gly 


Glv 


Asn 


Ser 


Glv 


His 


Glu 


215 










165 










170 










175 




216 


Glv 


Arcr 


Val 


Glu 


Leu 


Tvr* 


His 


Ala 


Glv 

o-l y 


Gin 


Trn 

x x.p 


Glv 

oxy 


Thr 


He 


Cys 


Asp 


217 








180 










185 










190 






218 


Asp 


Gin 


Tro 


Asp 


Asn 


Ala 


Asp 


Ala 


Asp 


Val 


He 


Cys 


Aver 


Gin 


Leu 


Glv 


219 






195 










200 










205 








220 


Leu 


Ser 


Glv 


lie 


Ala 


Lys 


Ala 


Trn 

x x, Ly 


His 


Gin 


Ala 


His 


Phe 


Glv 


Glu 


Glv 


221 




210 










215 










220 










222 


Ser 


Glv 


Pro 


lie 


Leu 


Leu 


Asp 


Glu 


Val 


Aver 


Cys 


Thr 


Glv 


Asn 


Glu 


Leu 


223 


225 










230 










235 










240 


224 


Ser 


lie 


Glu 


Gin 


Cys 


Pro 


Lys 


Ser 


Ser 


TrD 

X X.£/ 


Glv 


Glu 


His 


Asn 


Cys 


Glv 

vjj_y 


225 










245 










250 










255 




226 


His 


Lys 


Glu 


Asp 


Ala 


Glv 


Val 


Ser 


Cvs 


Val 


Pro 


Leu 


Thr 


Asp 


Glv 

vjj_y 


Val 


227 








260 










265 










270 






228 


lie 


Arcr 


Leu 


Ala 


Glv 


Glv 


Lys 


Ser 


Thr 


His 


Glu 


Glv 


Aver 


Leu 


Glu 


Val 


229 






275 










280 










285 








230 


Tvr 


Tvr 

x y x. 


Lys 


Glv 


Gin 


Tm 


Gly 


Thr 


Val 


v_y o 


Asp 


Asp 


Gly 


Tvn 

Xi \J 


Thr 


Glu 


231 




290 










295 










300 










232 


Met 


Asn 


Thr 


Tvr 

x y x. 


Val 


Ala 


Cys 


Aver 


Leu 


Leu 


Glv 


Phe 


Lys 


Tvv 


Glv 


Lys 


233 


305 










310 










315 










320 


234 


Gin 


Ser 


Ser 


Val 


Asn 


His 


Phe 


Asp 


Gly 


Ser 


Asn 


Arg 


Pro 


He 


Trp 


Leu 


235 










325 










330 










335 




236 


Asp 


Asp 


Val 


Ser 


Cys 


Ser 


Gly 


Lys 


Glu 


Val 


Ser 


Phe 


lie 


Gin 


Cys 


Ser 


237 








340 










345 










350 






238 


Arg 


Arg 


Gin 


Trp 


Gly 


Arg 


His 


Asp 


Cys 


Ser 


His 


Arg 


Glu 


Asp 


Val 


Gly 


239 






355 










360 










365 








240 


Leu 


Thr 


Cys 


Tyr 


Pro 


Asp 


Ser 


Asp 


Gly 


His 


Arg 


Leu 


Ser 


Pro 


Gly 


Phe 


241 




370 










375 










380 










242 


Pro 


lie 


Arg 


Leu 


Val 


Asp 


Gly 


Glu 


Asn 


Lys 


Lys 


Glu 


Gly 


Arg 


Val 


Glu 


243 


385 










390 










395 










400 


244 


Val 


Phe 


Val 


Asn 


Gly 


Gin 


Trp 


Gly 


Thr 


He 


Cys 


Asp 


Asp 


Gly 


Trp 


Thr 
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Line ? Error/Warning Original Text 



21 W "N" or "Xaa" used: Feature required gtgctcacng cngcbcaytg 

29 W "N" or "Xaa" used: Feature required agcggnccnc cdgartcvcc 



